NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Incorporating Taxonomic Reasoning and Regulatory Knowledge into Automated Privacy Question Answering

https://doi.org/10.1007/978-981-96-0579-8_31

Ravichander, Abhilasha; Yang, Ian; Chen, Rex; Wilson, Shomir; Norton, Thomas; Sadeh, Norman (November 2024, Springer Nature Singapore)

Privacy policies are often lengthy and complex legal documents, and are difficult for many people to read and comprehend. Recent research efforts have explored automated assistants that process the language in policies and answer people’s privacy questions. This study documents the importance of two different types of reasoning necessary to generate accurate answers to people’s privacy questions. The first is the need to support taxonomic reasoning about related terms commonly found in privacy policies. The second is the need to reason about regulatory disclosure requirements, given the prevalence of silence in privacy policy texts. Specifically, we report on a study involving the collection of 749 sets of expert annotations to answer privacy questions in the context of 210 different policy/question pairs. The study highlights the importance of taxonomic reasoning and of reasoning about regulatory disclosure requirements when it comes to accurately answering everyday privacy questions. Next we explore to what extent current generative AI tools are able to reliably handle this type of reasoning. Our results suggest that in their current form and in the absence of additional help, current models cannot reliably support the type of reasoning about regulatory disclosure requirements necessary to accurately answer privacy questions. We proceed to introduce and evaluate different approaches to improving their performance. Through this work, we aim to provide a richer understanding of the capabilities automated systems need to have to provide accurate answers to everyday privacy questions and, in the process, outline paths for adapting AI models for this purpose.
more » « less
Full Text Available
Incorporating Taxonomic Reasoning and Regulatory Knowledge into Automated Privacy Question Answering

Ravichander, Abhilasha; Yang, Ian; Chen, Rex; Wilson, Shomir; Norton, Thomas; Sadeh, Norman (November 2024, International Web Information Systems Engineering)

Full Text Available
Requirements Satisfiability with In-Context Learning

https://doi.org/10.1109/RE59067.2024.00025

Santos, Sarah; Breaux, Travis; Norton, Thomas; Haghighi, Sara; Ghanavati, Sepideh (June 2024, IEEE)

Language models that can learn a task at inference time, called in-context learning (ICL), show increasing promise in natural language inference tasks. In ICL, a model user constructs a prompt to describe a task with a natural language instruction and zero or more examples, called demonstrations. The prompt is then input to the language model to generate a completion. In this paper, we apply ICL to the design and evaluation of satisfaction arguments, which describe how a requirement is satisfied by a system specification and associated domain knowledge. The approach builds on three prompt design patterns, including augmented generation, prompt tuning, and chain-of-thought prompting, and is evaluated on a privacy problem to check whether a mobile app scenario and associated design description satisfies eight consent requirements from the EU General Data Protection Regulation (GDPR). The overall results show that GPT-4 can be used to verify requirements satisfaction with 96.7% accuracy and dissatisfaction with 93.2% accuracy. Inverting the requirement improves verification of dissatisfaction to 97.2%. Chain-of-thought prompting improves overall GPT-3.5 performance by 9.0% accuracy. We discuss the trade-offs among templates, models and prompt strategies and provide a detailed analysis of the generated specifications to inform how the approach can be applied in practice.
more » « less
Full Text Available
Patterns of Inquiry in a Community Forum for Legal Compliance with Privacy Law

https://doi.org/10.1109/REW61692.2024.00039

Santos, Sarah; Haghighi, Sara; Ghanavati, Sepideh; Breaux, Travis D; Norton, Thomas B (June 2024, IEEE)

Full Text Available
Creation and Analysis of an International Corpus of Privacy Laws.

Gupta, Sonu; Gopi, Geetika; Balaji, Harish; Poplavska, Ellen; O’Toole, Nora; Arora, Siddhant; Norton, Thomas; Sadeh, Norman; Wilson, Shomir (June 2024, roceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024))

The landscape of privacy laws and regulations around the world is complex and ever-changing. National and super-national laws, agreements, decrees, and other government-issued rules form a patchwork that companies must follow to operate internationally. To examine the status and evolution of this patchwork, we introduce the Privacy Law Corpus, of 1,043 privacy laws, regulations, and guidelines, covering 183 jurisdictions. This corpus enables a large-scale quantitative and qualitative examination of legal focus on privacy. We examine the temporal distribution of when privacy laws were created and illustrate the dramatic increase in privacy legislation over the past 50 years, although a finer-grained examination reveals that the rate of increase varies depending on the personal data types that privacy laws address. Our exploration also demonstrates that most privacy laws respectively address relatively few personal data types. Additionally, topic modeling results show the prevalence of common themes in privacy laws, such as finance, healthcare, and telecommunications. Finally, we release the corpus to the research community to promote further study.
more » « less
Full Text Available
Creation and Analysis of an International Corpus of Privacy Laws

Gupta, Sonu; Gopi, Geetika; Balaji, Harish; Poplavska, Ellen; O'Toole, Nora; Arora, Siddhant; Norton, Thomas; Sadeh, Norman; Wilson, Shomir (May 2024, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation)

The landscape of privacy laws and regulations around the world is complex and ever-changing. National and super-national laws, agreements, decrees, and other government-issued rules form a patchwork that companies must follow to operate internationally. To examine the status and evolution of this patchwork, we introduce the Privacy Law Corpus, of 1,043 privacy laws, regulations, and guidelines, covering 183 jurisdictions. This corpus enables a large-scale quantitative and qualitative examination of legal focus on privacy. We examine the temporal distribution of when privacy laws were created and illustrate the dramatic increase in privacy legislation over the past 50 years, although a finer-grained examination reveals that the rate of increase varies depending on the personal data types that privacy laws address. Our exploration also demonstrates that most privacy laws respectively address relatively few personal data types. Additionally, topic modeling results show the prevalence of common themes in privacy laws, such as finance, healthcare, and telecommunications. Finally, we release the corpus to the research community to promote further study.
more » « less
Full Text Available
Fighting the Fog: Evaluating the Clarity of Privacy Disclosures in the Age of CCPA

https://doi.org/10.1145/3463676.3485601

Chen, Rex; Fang, Fei; Norton, Thomas; McDonald, Aleecia M.; Sadeh, Norman (November 2021, WPES '21: Proceedings of the 20th Workshop on Workshop on Privacy in the Electronic Society)

Full Text Available
Breaking Down Walls of Text: How Can NLP Benefit Consumer Privacy?

https://doi.org/10.18653/v1/2021.acl-long.319

Ravichander, Abhilasha; Black, Alan; Norton, Thomas; Wilson, Shomir; Sadeh, Norman (August 2021, ACL)

Privacy plays a crucial role in preserving democratic ideals and personal autonomy. The dominant legal approach to privacy in many jurisdictions is the “Notice and Choice” paradigm, where privacy policies are the primary instrument used to convey information to users. However, privacy policies are long and complex documents that are difficult for users to read and comprehend. We discuss how language technologies can play an important role in addressing this information gap, reporting on initial progress towards helping three specific categories of stakeholders take advantage of digital privacy policies: consumers, enterprises, and regulators. Our goal is to provide a roadmap for the development and use of language technologies to empower users to reclaim control over their privacy, limit privacy harms, and rally research efforts from the community towards addressing an issue with large social impact. We highlight many remaining opportunities to develop language technologies that are more precise or nuanced in the way in which they use the text of privacy policies.
more » « less
Full Text Available
Breaking Down Walls of Text: How Can NLP Benefit Consumer Privacy?

https://doi.org/10.18653/v1/2021.acl-long.319

Ravichander, Abhilasha; Black, Alan W; Norton, Thomas; Wilson, Shomir; Sadeh, Norman (January 2021, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing)

Privacy plays a crucial role in preserving democratic ideals and personal autonomy. The dominant legal approach to privacy in many jurisdictions is the “Notice and Choice” paradigm, where privacy policies are the primary instrument used to convey information to users. However, privacy policies are long and complex documents that are difficult for users to read and comprehend. We discuss how language technologies can play an important role in addressing this information gap, reporting on initial progress towards helping three specific categories of stakeholders take advantage of digital privacy policies: consumers, enterprises, and regulators. Our goal is to provide a roadmap for the development and use of language technologies to empower users to reclaim control over their privacy, limit privacy harms, and rally research efforts from the community towards addressing an issue with large social impact. We highlight many remaining opportunities to develop language technologies that are more precise or nuanced in the way in which they use the text of privacy policies.
more » « less
Full Text Available
From Prescription to Description: Mapping the GDPR to a Privacy Policy Corpus Annotation Scheme

Poplavska, Ellen; Norton, Thomas B.; Wilson, Shomir; Sadeh, Norman (January 2020, Frontiers in artificial intelligence and applications)
null (Ed.)
The European Union’s General Data Protection Regulation (GDPR) has compelled businesses and other organizations to update their privacy policies to state specific information about their data practices. Simultaneously, researchers in natural language processing (NLP) have developed corpora and annotation schemes for extracting salient information from privacy policies, often independently of specific laws. To connect existing NLP research on privacy policies with the GDPR, we introduce a mapping from GDPR provisions to the OPP-115 annotation scheme, which serves as the basis for a growing number of projects to automatically classify privacy policy text. We show that assumptions made in the annotation scheme about the essential topics for a privacy policy reflect many of the same topics that the GDPR requires in these documents. This suggests that OPP-115 continues to be representative of the anatomy of a legally compliant privacy policy, and that the legal assumptions behind it represent the elements of data processing that ought to be disclosed within a policy for transparency. The correspondences we show between OPP-115 and the GDPR suggest the feasibility of bridging existing computational and legal research on privacy policies, benefiting both areas.
more » « less
Full Text Available

« Prev Next »

Search for: All records